Interpreting Intelligibility under Uncertain Data Imputation

نویسندگان

  • Brian Lim
  • Danding Wang
  • Tze Ping Loh
  • Kee Yuan Ngiam
چکیده

Many methods have been proposed to make machine learning more interpretable, but these have mainly been evaluated with simple use cases and well-curated datasets. In contrast, real-world data presents issues that can compromise the proper interpretation of explanations by end users. In this work, we investigate the impact of missing data and imputation on how users would understand, and use explanation features and propose two approaches to provide explanation interfaces for explaining feature attribution with uncertainty due to missing data imputation. This work aims to improve the understanding and trust of intelligible healthcare analytics in clinical end users to help drive the adoption of AI.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Imputation of parent-offspring trios and their effect on accuracy of genomic prediction using Bayesian method

The objective of this study was to evaluate the imputation accuracy of parent-offspring trios under different scenarios. By using simulated datasets, the performance Bayesian LASSO in genomic prediction was also examined. The genome consisted of 5 chromosomes and each chromosome was set as 1 Morgan length. The number of SNPs per chromosome was 10000. One hundred QTLs were randomly distributed a...

متن کامل

Influence of Pattern of Missing Data on Performance of Imputation Methods: An Example from National Data on Drug Injection in Prisons

Background Policy makers need models to be able to detect groups at high risk of HIV infection. Incomplete records and dirty data are frequently seen in national data sets. Presence of missing data challenges the practice of model development. Several studies suggested that performance of imputation methods is acceptable when missing rate is moderate. One of the issues which was of less concern...

متن کامل

Accuracy evaluation of different statistical and geostatistical censored data imputation approaches (Case study: Sari Gunay gold deposit)

Most of the geochemical datasets include missing data with different portions and this may cause a significant problem in geostatistical modeling or multivariate analysis of the data. Therefore, it is common to impute the missing data in most of geochemical studies. In this study, three approaches called half detection (HD), multiple imputation (MI), and the cosimulation based on Markov model 2...

متن کامل

Robust DEA under discrete uncertain data: a case study of Iranian electricity distribution companies

Crisp input and output data are fundamentally indispensable in traditional data envelopment analysis (DEA). However, the real-world problems often deal with imprecise or ambiguous data. In this paper, we propose a novel robust data envelopment model (RDEA) to investigate the efficiencies of decision-making units (DMU) when there are discrete uncertain input and output data. The method is based ...

متن کامل

On Providing Intelligibility-Aware Preservation Services for Digital Objects

Preserving digital objects requires preservation of not only their bit-level representation but also their intelligibility. To this end a digital object should be associated with metadata appropriate for interpreting that object; such metadata are often referred as representation information. Even such metadata may not be intelligible, however, so we may have to associate them with extra metada...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018